Model Selection

Chart Parsing

# Chart Parsing

Hyperclovax SEED Vision Instruct 3B

HyperCLOVAX-SEED-Vision-Instruct-3B is a lightweight multimodal model developed by NAVER, featuring image-text understanding and text generation capabilities, with special optimization for Korean language processing.

naver-hyperclovax

Mlcd Vit Bigg Patch14 448

MLCD-ViT-bigG is an advanced Vision Transformer model enhanced with 2D Rotary Position Encoding (RoPE2D), excelling in document understanding and visual question answering tasks.

Text Recognition

H2ovl Mississippi 800m

An 800M-parameter vision-language model from H2O.ai, specializing in OCR and document understanding with excellent performance

Transformers English

Fuyu-8B is a multimodal text-image transformer developed by Adept AI, designed for digital agents, supporting arbitrary image resolutions with swift responses and a streamlined architecture.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase